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Abstract —Before establishing a communication link in a cel¬ 
lular network, the user terminal must activate a synchronization 
procedure called initial cell search in order to acquire specific 
information about the serving base station. To accomplish this 
task, the primary synchronization signal (PSS) and secondary 
synchronization signal (SSS) are periodically transmitted in the 
downlink of a Long Term Evolution (LTE) network. 

Since SSS detection can be performed only after successful 
identification of the primary signal, in this work we present 
a robust scheme for joint PSS detection, sector index iden¬ 
tification and integer frequency offset (IFO) recovery in an 
LTE system. The proposed aigorithm relies on the maximum 
likelihood (ML) estimation criterion and exploits a suitable 
reduced-rank representation of the channel frequency response 
to take multipath distortions into account. We show that some 
PSS detection methods that were originally introduced through 
heuristic reasoning can be derived from our ML framework by 
selecting an appropriate model for the channel gains over the 
PSS subcarriers. 

Numerical simulations indicate that the proposed scheme 
can be effectively applied in the presence of severe multipath 
propagation, where existing alternatives provide unsatisfactory 
performance. 

Index Terms —LTE, cell search, sector index identification, 
integer frequency offset recovery. 

I. Introduction 

The Long Term Evolution (LTE) mobile communication 
standard has been developed by the 3rd Generation Partner¬ 
ship Project (3GPP) in order to enhance the performance of 
currently deployed 3G systems in terms of data throughput, 
spectrum utilization and user mobility JT). This technology 
supports various channel bandwidths ranging from 1.4 to 20 
MHz and promises peak data rates of 100 Mbit/s in the 
downlink (DL) and 50 Mbit/s in the uplink (UL) for the 
20 MHz system bandwidth Improved resilience against 
multipath distortion, high spectral efficiency and ability to 
handle different data rates are achieved by using orthogonal 
frequency-division multiple-access (OFDMA) in the DL, while 
single-carrier frequency-division multiple-access (SC-FDMA) 
is adopted in the UL due to its reduced peak-to-average 
power-ratio a. There is also the possibility of choosing 
between normal and extended cyclic prefix (CP), the latter 
being considered for large delay spread environments. 

LTE supports multi-cell communications, with cell informa¬ 
tion being conveyed by an integer number called cell-ID. Upon 
entering the network or during an handover operation, the 
user equipment (UE) must recover the cell-ID of the serving 
base station (eNodeB) and has also to acquire correct timing 
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and frequency synchronization. This operation is known as 
initial cell-search a, s and is accomplished by exploiting a 
dedicated synchronization channel (SCH) periodically inserted 
in the DL radio frame ©• The SCH conveys two signals called 
Primary Synchronization Sequence (PSS) and Secondary Syn¬ 
chronization Sequence (SSS). The former is generated from a 
63-length frequency-domain Zafoff-Chu (ZC) sequence whose 
root index univocally determines the sector identity. The latter 
is an interleaved concatenation of two length-31 scrambled 
//(-sequences specifying the cell ID group. 

Although in principle it is possible to consider the joint 
estimation of all synchronization parameters and cell-ID, a 
more pragmatic approach relies on the following three-stage 
procedure 0: 

1) Firstly, fractional frequency offset (FFO) and coarse 
symbol timing recovery is accomplished using the re¬ 
dundancy introduced by the CP. This method was origi¬ 
nally proposed in 0 and its accuracy can be improved 
by averaging the timing and frequency metrics over 
several OFDM symbols; 

2) In the second step, the UE detects the position of the 
PSS within the received DL signal in order to acquire 
subframe timing information, and also determines the 
sector index by identifying which primary sequence has 
been transmitted out of three possible alternatives. The 
integer frequency offset (IFO) can be retrieved at this 
stage by evaluating the shift of the received PSS in the 
frequency domain; 

3) The final step recovers the cell ID group and identifies 
the frame boundary by using the received SSS. Once 
these operations have been completed, the UE is able 
to read some basic configuration information broadcast 
from the eNodeB, such as the system bandwidth, CP 
length and duplexing mode. 

Since SSS detection can be accomplished only after suc¬ 
cessful identification of the primary sequence, PSS detection 
represents a crucial task in the overall cell search procedure. 
For this reason, it has attracted much attention in the last 
few years and many solutions are currently available. Some 
of them operate in the time-domain (TD), while others ex¬ 
ploit the frequency-domain (FD) samples provided by the 
receive discrete Fourier transform (DFT) unit. Examples of TD 
schemes can be found in m-m, where the PSS is revealed 
by looking for the peak of the cross-correlation between 
the received samples and the three locally regenerated ZC 
sequences. However, since the SCH is transmitted onto a set 
of 62 dedicated subcarriers with the other subcarriers being 
modulated by data symbols, the PSS should be extracted from 
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the DL signal before the correlation stage. This requires a 
high-order filtering operation of the received signal, which 
results into an increase of the hardware complexity and also 
leads to some performance degradation since the contribution 
of data subcarriers cannot be totally filtered out due to the 
spectral leakage and the uncompensated CFO. TD methods 
with reduced complexity are suggested in G2 and na, where 
the tasks of PSS detection and sector index recovery are 
decoupled by either exploiting the central symmetric-property 
of the ZC sequences or by correlating the DL signal with the 
sum of the three possible primary sequences. 

As an alternative to the TD approach, in FD schemes the 
contribution of the data-bearing subcarriers is eliminated by 
selecting the SCH at the receive DFT output and correlating 
the resulting samples with replicas of the three tentative 
frequency-domain ZC sequences m. This method works 
properly as long as the channel gain can be considered as 
approximately constant over the SCH subband. Unfortunately, 
such an assumption does not hold for transmissions over severe 
multipath channels or in the presence of a non-negligible 
timing error, which appears as a linearly increasing phase 
shift at the DFT output. In these circumstances, the correlation 
properties of the received PSS will be irreparably destroyed 
with an ensuing degradation of the system performance. To 
solve this problem, differential correlation in the FD has been 
proposed in lfl5l . where IFO detection is also accomplished 
during the PSS match process. A disadvantage of the differen¬ 
tial approach is that the peaks of the resulting metric are quite 
close for different IFO values, thereby reducing its ability to 
recover the frequency error m- Furthermore, compared to 
tm the accuracy of the FD differential correlator is greatly 
reduced whenever the channel frequency response (CFR) over 
the SCH is approximately constant. For this reason, the authors 
of El propose to adaptively choose between non-differential 
and differential detection on the basis of a previous estimate 
of the maximum channel delay spread. A simpler solution is 
found in m by resorting to the partial correlation concept, 
wherein the received SCH is partitioned into several adjacent 
FD segments which are subsequently correlated with the cor¬ 
responding parts of the tentative ZC sequences. A fundamental 
design parameter of this approach is represented by the number 
of segments, which must be selected in accordance with the 
coherence bandwidth of the transmission channel. 

It is worth noting that all the aforementioned methods have 
been derived by means of heuristic reasoning. In order to check 
whether their performance can be substantially improved or 
not, it is of interest to make comparisons with alternative 
approaches based on some optimality criterion. With this goal 
in mind, in the present work we employ maximum likelihood 
(ML) methods to study the problem of PSS detection, sector 
index identification and IFO recovery, which represents the 
second step of the cell search process in an LTE system. 
In contrast to previous investigations, our analysis, which 
originates from the work presented in (l9l . explicitly takes into 
account the multipath distortion introduced by the propagation 
medium on the received signal. This is achieved by treating 
the CFR over the SCH subcarriers as a nuisance vector l2Qj . 
which is jointly estimated along with the synchronization 


parameters. As we shall see, this approach may lead to an 
under-determined estimation problem wherein the number of 
quantities to be recovered exceeds the number of available 
data. Reduced-rank approximations of the CFR are proposed 
to cope with such a situation using either the minimum mean 
square-error criterion (MMSE) or the classical polynomial ba¬ 
sis expansion. This approach results into a general framework 
which includes both the conventional FD scheme m and the 
partial correlation concept ca through a suitable selection 
of the CFR model. A key assumption of our analysis is that 
symbol timing and FFO have been previously acquired using 
the classical CP-based method presented in However, since 
multipath propagation may significantly reduce the accuracy 
of the timing estimates provided by ID, a residual timing 
error is included in the adopted system model and numerical 
simulations are used to assess its impact on the overall 
performance. 

The rest of the paper is organized as follows. Next section 
illustrates the signal model and formulates the estimation prob¬ 
lem. Sect. Ill presents the joint ML estimator of the unknown 
parameters using a generic expansion basis to represent the 
CFR. An MMSE reduced-rank approximation of the CFR 
is derived in Sect. IV along with other possible channel 
representations that lead to alternative known PSS detection 
schemes. After discussing numerical simulations in Sect. V, 
we offer some conclusions in Sect. VI. 

Notation: Matrices and vectors are denoted by boldface 
letters, with In being the identity matrix of order N, Ok the 
A'-dimensional null vector and 1 k a A'-dimensional vector 
with unit entries. The notation || • || indicates the norm of the 
enclosed vector, while B 1 and tr{B} are the inverse and 
the trace of a matrix B. We use E{-}, (■)*, (-) T and (•) H for 
expectation, complex conjugation, transposition and Hermitian 
transposition, respectively. The notation 5ie{ } stands for the 
real part of a complex-valued quantity, while | • | represents the 
corresponding modulus. Finally, we use A to indicate a trial 
value of an unknown parameter A. 

II. System model 
A. LTE frame structure 

We consider the DL of an LTE system operating in 
frequency-division-duplexing (FDD) mode. As shown in 
Fig-0 data transmission is organized in radio frame units of 
length 10 ms. Each frame is divided into ten 1 ms subframes, 
which are further partitioned into two slots of length 0.5 ms. 
Every slot contains 7 OFDMA symbols in case of normal CP, 
while 6 OFDMA symbols are present when the extended CP 
is employed to cope with a large delay spread. In the time- 
frequency grid, system resources are divided into resource 
blocks (RBs), such that each RB is composed of 12 contiguous 
subcarriers for a duration of one slot. According to the 
LTE specifications, 504 different physical-layer cell IDs are 
available and arranged into 168 distinct groups. Each group 
is identified by the cell ID group g {0,1,..., 167} and 
contains three different sectors, which are specified by the 
sector ID Njp g {0,1,2}. In addition to acquiring the correct 
timing and frequency synchronization, during the cell search 
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OFDM symbol 


Fig. 1. Position of PSS (green) and SSS (red) in the LTE downlink frame. 


procedure the UE must retrieve the integer-valued parameters 
Njp and Njp, from which the cell ID of the serving eNodeB 
is uniquely computed as Nf^ 1 = 3 Nj^+Njq. To accomplish 
these tasks, two synchronization sequences called PSS and 
SSS are periodically transmitted on a dedicated SCH to specify 
the sector ID and cell group, respectively. In particular, the 
PSS is located in the last OFDM symbol of the first and 11th 
slots of each radio frame, while the SSS is transmitted in the 
symbol immediately preceding the PSS. As LTE allows for 
various system bandwidths, both PSS and SSS are mapped 
onto a 73-subcarriers subband located symmetrically around 
DC, which corresponds to the smallest allowable spectrum 
occupancy. Such design choice enables the detection of the 
synchronization signals without requiring explicit knowledge 
of the system bandwidth. 


B. Primary synchronization sequence 

The PSS is chosen in a set of three different 63- 
length ZC sequences characterized by excellent auto- and 
cross-correlation properties ED- More precisely, during 
the PSS transmission the SCH subcarriers with indices 
n £ {0, ±1, ±2,..., ±36} are modulated by the following 
polyphase complex exponential terms 


a u (n) 


g—j'7ra(n 2 +63n+110)/63 if n £ X 

0 otherwise 


(1) 


where u denotes the root index of the selected ZC sequence 
and X = {n £ Z : |n| < 31 and n ^ 0}. The above equation 
indicates that, out of the 73 available subcarriers, only 62 
are modulated by the PSS, while the remaining eleven (five 
placed at the SCH boundaries and one placed at DC) are left 
unfilled. The root index belongs to the set J u = {25,29,34} 
and specifies the sector ID Njp as shown in Table M 


TABLE I 


r( 2) 


N^ 

Root index u 

0 

25 

t 

29 

2 

34 


C. Overview of the cell search procedure 

Cell search is a three-stage procedure which is performed 
when the UE is switched on or when it loses synchronization. 


Since LTE must provide mobility up to 350 km/h, this process 
is periodically repeated to find a candidate cell for the han¬ 
dover operation. In the first stage, the UE retrieves the FFO and 
acquires coarse information about the OFDM symbol timing. 
This operation is typically accomplished in the time domain 
using the conventional CP-based delay correlation method 
presented in (8). After FFO correction and CP removal, the 
resulting samples are converted in the frequency domain using 
a DFT unit. The second stage detects the position of the PSS 
within the received DL signal and recovers the ZC root index 
u. These tasks can be accomplished either in the time or 
frequency domain, and provide subframe timing information 
as well as the sector ID. In the third stage, the SSS is exploited 
to get the cell ID group and the frame boundary. Recalling that 
the SSS is located in the symbol immediately preceding the 
PSS, the latter is normally used as a phase reference to perform 
coherent detection of the SSS in the frequency domain. As 
for the IFO, it can be estimated either in the second or third 
step by evaluating the frequency domain shift of the received 
PSS or SSS at the DFT output. However, since there are 168 
different secondary sequences compared to only three primary 
sequences, the complexity of the IFO search is greatly reduced 
if it is performed in the second step in conjunction with PSS 
detection. 

D. Signal model and problem formulation 

We now concentrate on the second stage of the cell search 
process and provide the signal model for PSS detection 
and IFO recovery. The DL signal propagates through a 
multipath channel with discrete-time impulse response h = 
[h{ 0), h( 1),..., h(L— 1)] T of order L. At the UE receiver, the 
incoming signal is down-converted to baseband and sampled 
with period T s . Then, the CP delay correlation method J8) is 
applied to retrieve the FFO and the OFDM symbol timing. 
After FFO compensation, timing information is exploited to 
remove the CP and to divide the stream of time-domain sam¬ 
ples into segments x*. = [a;fc(0), x k (l ),... ,x k (N — 1)] (with 
k = 1,2,...) which are subsequently fed to the N —point DFT 
unit. We denote by 

1 N - 1 

X k (n) = -= x k {f)e-^ n ' N (2) 

the nth DFT output corresponding to the kth segment and 
assume that the accuracy of the frequency and timing estimates 
is such that we can reasonably neglect any inter-channel 
or inter-block interference present on X k [n). However, due 
to a possible residual timing error, the quantities X k (n) 
will be affected by a phase rotation that increases linearly 
with the subcarrier index n. Since the PSS is transmitted 
twice per frame, the observation window for PSS detection 
encompasses Nq adjacent OFDM symbols composing a half 
frame, with Nq being either 60 or 70 for the extended or 
normal CP transmission mode, respectively. Without loss of 
generality, in this work we consider an LTE system with an 
extended CP and let Nq = 60. Furthermore, we assume 
that the PSS is transmitted on the qth OFDM symbol within 
the observation window k = 1,2, ..., Nq and denote by 
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Xfc = [X fc (—36),A' fc (—35),...,X fc (36)] the 73-dimensional 
vector collecting the DFT output over the SCH channel. Then, 
for k ^ q we model X fc as a zero mean Gaussian vector 
with covariance matrix = af, I 73 (< 7 ^ being an unknown 
parameter), while for k = q we have 

X q (n) = H{n — v)a u {n — v)e^^ n ~ u ^ 6 ^ N + w q {n) |n| < 36 

(3) 

where v represents the IFO, 9 denotes the residual timing error 
(normalized by the sampling period) and H(n) is the CFR over 
the nth subcarrier. Finally, w q [n ) is the noise contribution, 
which is modeled as a circularly-symmetric white Gaussian 
process with average power cr}} =E{|u>,(n)|“}. Without loss 
of generality, we consider 9 as an integer-valued parameter. 
The reason is that any residual fractional timing offset (FTO) 
can be incorporated into h (£) by letting h(£) = h(t) | t _^ +e ) T , 
where e denotes the FTO and h(t) is the continuous-time 
version of the channel impulse response (CIR). 

Our goal is to exploit the set of observations X = 
[Xf X^ ... X^] T to find the joint ML estimate of the 
unknown parameters {q. a. v\ specifying the PSS position, the 
sector ID and the IFO. Unfortunately, this task is complicated 
by the presence of the nuisance quantities 
where H = {H(n)\ |n| < 31} and er 2 = {cr 2 ; 1 < k < Nq 
and k / q}. In order to reduce the number of estimation 
parameters, we propose to merge H and 9 into an equivalent 
CFR H e , = {H eq (n)]\n\ < 31} with elements H eq (n) = 
H(n)e~P^ ne . Moreover, since channel gains over blocks 
of contiguous subcarriers are highly correlated, we expect that 
H eq can be accurately expanded over a reduced-rank basis of 
P < 63 vectors {bi,b 2 ,...,bp}. This amounts to putting 

He, - B£ (4) 

where B is a matrix with columns b, (i = 1,2,..., P) and 
£ = [£( 1 ), £( 2 ),..., £(P)] T is a vector of expansion coeffi¬ 
cients. A number of possible reduced-rank representations of 
H e , will be presented later. 


III. Joint estimation of the unknown parameters 


We arrange the PSS position, sector ID and IFO into a 
vector p = [q,u, v] and let r/ = [£, cr 2 , cr}}] be the set of 
nuisance parameters. The log-likelihood function (LLF) for 
the unknown quantities {ip, 77 } is obtained from (|3]> as 


A(p,f]) 


Nq 


E 


731 n( 7 T<j|) + 


k =1 
k^q 



^ 36 

731n(7TCT 2 ) - p- I X g( n ) 

° w n=-36 


(5) 
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where we have defined s(£>,£;n) = H eq (n — i>)au(n — v) 

with H e , = B£. The joint ML estimate of the unknown 
parameters is found by looking for the global maximum of 
A (p,f)). Maximizing with respect to a 2 and <r 2 , produces 

^ l|X fc || 2 (6) 


and 


36 


a 2 ,= 


73 


E - s{p,l;n) 


n =—36 


(7) 


These results are substituted back into ([5j> in place of <r 2 and 
ct 2 , yielding 


MM =73 


Nq 

E ln (^n x ^ii 2 )+ ln (^ l|x?H|2 ) 


( 8 ) 


k =1 


-Nq- ln(^ E | X,~(n) 


36 


n =—36 


Skipping irrelevant multiplicative and additive terms indepen¬ 
dent of the optimization variables and exploiting the mono¬ 
tonicity of the log function, we may replace A(<p,£) by the 
equivalent objective function 


Ai(£,i) = - 


36 2 

X q (n) - H eq (n - v)au(n - u) 

n =—36 


• (9) 


To proceed further, we make a change of the indexing variable 
n — v —► to and recall that the PSS values a u (m) are non¬ 
zero only for m £ I as specified in |T|). Hence, assuming that 
\v\ < 5, after dropping an immaterial term in (|9]) we obtain 




25Re 


f 3t j 



£ Z q (u, v\ m)H* q (m) 
lm =-31 J 

\- 

H e , 


X 


q 


2 


( 10 ) 


where we have defined Z q {u,v\m ) = X q (m + k)a^(m). 
Denoting by Z q (u, v) the 63-dimensional vector with entries 
{Z q (u,v\m)\ \m\ < 31}, we may express A 2 (<}?,£) in matrix 
notation as 


2^{'^ H B H Z q {u,D)} ~^B H B£ 

A 2 (£,£) =- 1 --- t / -■ (ID 

X. 


The maximum of A 2 (£>,£) with respect to £ is achieved at 

€ = (B H B)~ 1 B H Z q (u,u) (12) 

and plugging this result back into GD yields the concentrated 
likelihood function 

Z?(u,£)GZ„~(w,£>) 

M<P) = 9 2 J (13) 

X.- 

where G = B (B h B) 1 B H . The joint ML estimate of the 
unknown parameters is eventually obtained as 


p = argmax{A 3 (£>)} (14) 

v 

which requires a search over the multi-dimensional domain 
spanned by p. However, since p = \q, u, v] has integer-valued 
entries, there is only a finite number of hypothesized values of 
p. At this stage, the problem arises as how to select a suitable 
basis {b|, b 2 ,..., bp} for the reduced-rank representation of 
H e , which can exhibit low-sensitivity to the timing error 9. 
Possible solutions are presented in the next Section. 
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IV. Reduced-rank representation of the channel 

FREQUENCY RESPONSE 

A. Problem formulation 

In OFDM systems, the CFR is typically expressed in terms 
of the discrete-time CIR as 

L -1 


H(n) = Kfy 


—j27vn£/N 


(15) 


e=o 


Recalling that H eq (n) = H(n)e i 2 ' n " ne / N , from (15 i it follows 
that the equivalent CFR takes the form 


L+e-i 


H eq {n) = Y, 


—j2nnl/N 


(16) 


e=e 


Since coarse timing recovery schemes for OFDM systems 
adopt a back-off design wherein the timing estimates are 
pre-advanced to avoid inter-block-interference at the DFT 
output ll 22 l . in the sequel it is assumed that there is no 
negative timing error. This amounts to letting 6 vary in the set 
{ 0 , 1 ,... , 0 max }, where 0 max is selected on the basis of the 
accuracy of the timing estimator. In such a case, the equivalent 
CFR can be written in matrix notation as 


H e , = Fhe, (17) 

where h eq = [0^ h 7 oj (j \ r is the equivalent CIR vector 
of order L eq = L + 0 max and F is a 63 x L eq matrix with 
entries 


[F] n>/ = e~ j2 ™ e ' N \n\ <31, 0 < l < L eq - 1 . (18) 


Since in a practical LTE scenario the normalized timing error 
may be as large as 40 m, the channel order L eq is expected to 
be close or even to exceed the value 63. In this case, we cannot 
interpret (17 1 as a reduced-rank representation of H e , and the 
goal is to find a basis {bi, b 2 ,..., bp} of order P <C 63 
such that the orthogonal projection of H e , on the selected 
basis, say Hp = GH eq , is a good approximation of H eq . 
The accuracy of such an approximation can be measured in 
terms of the mean-square-error (MSE) between H e , and Hp, 
which is defined as E{||H», — Hp|| 2 }. Recalling that G = 
B (B P B) 1 B p , from (17 1 the MSE is found to be 


MSE(B) = E{h p F ff G p Fh e ,} (19) 

where we have taken into account that G = I 6 3 — G is 
an idempotent matrix and we have explicitly indicated the 
dependence of the MSE on the expansion matrix B. After 
standard manipulations, we can rewrite © as 

MSE(B) = tr{G p FC e ,F p } (20) 


with C eq =E{h e ,h p } being the covariance matrix of h eq . 


B. MMSE reduced-rank representation 

For a fixed value of P, the optimum expansion basis is 
the one that minimizes MSE(B). Neglecting an irrelevant 
additive term independent of B, it is seen that the minimum 
of MSE(B) is achieved by maximizing the metric 

7 (B) = tr{B(B i/ B)~ 1 B H FC e ,F ff } (21) 


with respect to B. To solve this problem, it is convenient to 
consider the compact singular value decomposition (SVD) of 
B, which is given by 

B = U£V P (22) 


where £ £ R Px P is the diagonal matrix containing the P 
non-zero singular values of B sorted in decreasing order, with 
the columns of U £ C 63xP and V £ C PxP being the corre¬ 
sponding left and right eigenvectors, respectively. Substituting 
(22 1 into (21 1 and observing that U P U = V P V = Ip, after 
some algebraic computations we can rewrite 7 (B) as 


7 (B) =tr{U p FC e ,F p U} 


(23) 


from which we see that the objective function only depends 
on U. Maximizing the right-hand-side of (231 with respect 
to U is a well-known optimization problem, whose solution 
is obtained by selecting as columns of U the P eigenvectors 
of FC f: ,F // associated to the P largest eigenvalues. In the 
sequel, we denote by U mmse the matrix U provided by such 
a design criterion. Since £ and V can be arbitrarily chosen 
without affecting the value of the MSE, for simplicity we let 
£ = V = Ip and obtain B = U mmse ■ 

Unfortunately, computing U mmse requires knowledge of 
C eq , which depends on the channel power delay profile and 
the timing error. Since these quantities are generally unknown 
at the receiver, the exact MMSE solution cannot be pursued in 
practice. A possible way out is found by allowing the system 
to operate in a mismatched mode wherein C eq is replaced 
by I L eq - This leads to an approximated MMSE (AMMSE) 
solution, say B = U ammse , in which the columns of B are 
the P normalized eigenvectors of FF P associated to the P 
largest eigenvalues. In such a case, the concentrated likelihood 
function in ( fl3| becomes 

IIt - 1 'ammse F)|| 


a 3 (<p) = 


(24) 


C. Polynomial-based reduced-rank (PRR) representation 
An alternative reduced-rank representation of H e , can be 
obtained by approximating the CFR with a (P — 1)—order 
polynomial function as 

p 

H eq (n) \n\ < 31. (25) 

P =i 

This amounts to putting B = Bppp, where Bppp is a 63 x P 
matrix with entries 

[Bppp]„, p = n p_1 \n\ <31, 1 < p < P. (26) 


D. Piecewise-constant reduced-rank (PCRR) representation 
The piecewise-constant reduced-rank representation 
(PCRR) of the CFR is obtained by arranging the 63 SCH 
subcarriers into P adjacent subbands and assuming that 
H eq {n) is approximately constant on each subband. Hence, 
denoting by K p the number of subcarriers contained in the 
pth subband (with p = 1 , 2 ,..., P), we have 

H eq (n) ~ £(p) J p -i - 31 < n < J p - 32 (27) 
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where 


v 

J'p — ^ ^ Krr 
m=l 


1 <p< p 


(28) 


and Jo = 0. If P is an integer divider of 63, all subbands have 
the same number of subcarriers K p = 63/ P. Otherwise, we let 
63 = MP+R, where M =int{63/P} and R £ {1,2,... ,P— 
1} are the quotient and remainder of the integer division 63/P, 
respectively. Then, the size of the P subbands are designed 
such that K p = M + 1 for 1 < p < R and K p = M for 
R,+ \ < p < P . In matrix notation, H,., y can be written as in 
0 after letting B = Bpqrr , where Bpcrr is the 63 x P 
following matrix 


B 


PCRR = 


1 K l 

0/Y, 

0 Ki 

0 k 2 

1 k 2 ■ ■ 

0 k 2 

0 K P 

O 

* •* 

1 Kp 


(29) 


In such a case, the concentrated likelihood function in ( fL3j ) 
becomes 




ixjir u K v 


J v - 32 


Y Zg(u, Z>; n) 


n=J v -1 — 31 


(30) 


which is reminiscent of the partial correlation method pre 
sented in fT 8 l . Interestingly, letting P = 1 in ( [30] ) yields 

2 

1 

l2 


A 3 (¥>) = 


63 IIXq 


31 


£ ^ 

n=—31 




>;n) 


(31) 


which coincides with the conventional frequency-domain 
correlation-based (CFDC) metric originally proposed in lfl4ll 
for the joint estimation of u and v. 


E. Complexity issues 

The processing load of the estimator ( |T 4 | depends on which 
reduced-rank representation is adopted for the CFR. For both 
the AMMSE and PRR solutions, the metric A 3 (y?) can be 
efficiently computed as 


a 3 (<Z>) = 


C h B h Z g(u,i>)\ 


(32) 


where CC H = (B h B 


, -1 


is the Choleski decomposition of 
(B h B) , while for the PCRR it is convenient to evaluate 
A .3 (<^>j as indicated in (30). In any case, for each new received 


OFDM symbol, the receiver must compute the quantities 
11X-11 and Zq(u,D). Assuming that the entries of X- are 
available, 291 floating-point-operations (flops) are required to 
get 11X-11 , while 372 flops are needed to evaluate Z q (u,u) 
for each couple (u, v). When using the AMMSE solution, the 
numerator of A 3 (<^>) in (32) is computed with 498P flops 
for each (u, v). This figure reduces to 250P flops with the 
PRR approximation as in this case the matrix C H B H is real¬ 


valued. As for the PCRR, evaluating A 3 (£>) in (30i starting 
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from ||X-1|“ and Z q {u,i>) needs 123 ± 3P flops for each 
couple (u, v\ which reduces to 126 flops when considering 
the CFDC. The overall complexity of the investigated schemes 


is summarized in Table [II] for each received OFDM symbol. 
In writing these figures we have borne in mind that ft varies in 
the set {25,29,34} and we have denoted by N v the number 
of different values of v. 


TABLE II 

Complexity of the investigated schemes in terms of number of 
FLOPS FOR EACH OFDM SYMBOL. 


Algorithm 

Required flops 

AMMSE 

291 + 1116 N v + 1494ALP 

PRR 

291 + 1116AL + 750 N V P 

PCRR 

291 + 14851W + 9ALP 

CFDC 

291 + 1494AL 


V. Simulation results 

Computer simulations have been run to assess the per¬ 
formance of the presented PSS detection and IFO recovery 
schemes using different reduced-rank representations of the 
CFR. The LTE simulation set-up is chosen according to the 
3GPP specifications ED and is summarized as follows. 

A. Simulation model 

We consider a 20 MHz LTE communication system with 15 
kHz subcarrier spacing. At the receiver side, the baseband sig¬ 
nal is sampled with frequency f s = 30.72 MHz and converted 
in the frequency-domain through a 2048-point DFT unit. To 
demonstrate the capability of the investigated scheme in a chal¬ 
lenging scenario, we adopt the Extended Typical Urban (ETU) 
channel model characterized by 9 channel taps with maximum 
excess delay r max = 5 ps. The path gains are modeled 
as statistically independent random variables with zero-mean 
and Gaussian distribution (Rayleigh fading). A raised-cosine 
functions with roll-off 0.22 and time-duration of 6 sampling 
periods is employed for the pulse shaping, which corresponds 
to an overall CIR of order L =int{/ s r max + 6} = 160. 
Assuming a maximum normalized timing error 0 max = 40 
m, the length of the equivalent CIR vector h eq is found to be 
L + ((max = 200. However, since parameter P (and the system 
complexity) is expected to increase with the CIR duration, 
in the design of the AMMSE we use L eq = 120 , which 
amounts to reducing the size of matrix F from 63 x 200 to 
63 x 120. This choice is motivated by the fact that in the ETU 
channel model there is only one multipath component with a 
path delay greater than 2.3 ps which, moreover, collects less 
than 3% of the average channel power. The value L eq = 120 
is also compliant with other LTE channel models, such as 
the Extended Vehicular A (EVA) and Extended Pedestrian A 
(EPA), which are characterized by a maximum excess delay 
of 2.51 ps and 0.41 ps, respectively. 

Without any loss of generality, we adopt the extended CP 
transmission mode wherein 6 OFDM symbols are present in 
each slot. Since the PSS is transmitted every 10 slots, in our 
simulations we let Nq = 60. The range of CFO values is 
related to the oscillator instability, while it is only marginally 
affected by the UE mobility. Hence, assuming that the stability 
of commercial oscillators for mobile applications is in the 
order of ±10 parts-per-million (ppm) at both the transmit and 
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receive ends, the maximum CFO value is approximately 2.66 
subcarrier spacing at the carrier frequency of 2 GHz. Accord¬ 
ingly, the search range for the IFO is J v = {0, ±1, ±2, ±3}, 
which amounts to putting N v = 7. Recalling that the PSS is 
chosen from a set of three possible ZC sequences with root 
index u € {25, 29,34}, the overall search space for the triplet 
c p = [q, u, v\ has cardinality = 3 N q N v = 1260. 

The accuracy of the PSS detection and IFO recovery 
schemes is measured in terms of the error rate incurred in 
the estimation of each parameter of interest q, u and v. As 
a global performance indicator, we also consider the overall 
probability of failure Pf = Pr{£> f ip}. 


B. Performance evaluation 

Fig. illustrates the quantity MSE(B) reported in ( |20| ) 
as a function of P for the investigated reduced-rank CFR 
representations. As expected, the best results are obtained 
with B = U mmse since this choice minimizes MSE(B) for 
a fixed value of P. Compared to the true MMSE solution, 
the AMMSE is characterized by a higher MSE, even though 
it largely outperforms both the PRR and PCRR schemes. 
Although the accuracy of the considered reduced-rank repre¬ 
sentations steadily improves with P, the probability of failure 
Pf cannot exhibit a similar behaviour. The reason is that B 
has dimension 63 x P and, accordingly, it becomes a square 
matrix when P = 63. In the latter case, G = B (B ff B) B ff 
reduces to Ik 3 and G L = I fi3 G is therefore the null matrix. 


This situation has two consequences. On one hand, from (201 
we see that MSE(B) = 0, thereby justifying the fact that the 
MSE decreases as P approaches the value 63. On the other 
hand, substituting G = Ig 3 into (fT3j) leads to 


a 3 (<p) = 


(33) 



Fig. 2. MSE(B) vs. P for different reduced-rank CFR representations. 


a, 



or, equivalently. 


a 3 (<p) 


31 


E 

m =—31 


\Xq{m + F)| 2 


ll x ?f 


(34) 


This equation indicates that the metric A 3 (y>) becomes inde¬ 
pendent of u when P = 63, thereby preventing any possibility 
to recover the sector index. From the above discussion it 
follows that parameter P must be judiciously designed so as 
to meet two conflicting requirements. On one hand, it must 
be large enough to produce a sufficiently accurate reduced- 
rank representation of the CFR. On the other hand, it must be 
adequately small since otherwise the estimation problem con¬ 
tains too many unknown quantities which cannot be estimated 
reliably. 

This intuition is corroborated by the results of Fig. [3] 
where Pf is shown as a function of P. Here, the signal- 
to-noise ratio (SNR) is fixed at 8 dB and the timing error 
is 6 = 9 max = 40. As is seen, for each curve there is an 
optimum value of P which minimizes Pf. Since the minimum 
is approximately attained at P = 5 by all the considered 
reduced-rank representations, such a value is used in the 


Fig. 3. Probability of synchronization failure vs. P for SNR = 8 dB and 
6 = 40. 


subsequent simulations except for the CFDC scheme, which 
is obtained from the PCRR by letting P = 1. 

Fig. [4] illustrates the error rate incurred in detecting the 
presence of the PSS, say P q = Pr{g 7^ q}, vs. the SNR 
for 9 = 40. In addition to the investigated schemes, we also 
consider the PSS differential detector (DD) proposed in lfl4l . 
which provides an estimate of the unknown parameters [q. u, v\ 
by looking for the maximum of the following metric 

^ e { E Zg(u,i>;n)Z%(u,i>;n- 1)1 
A dd(v>) = - l!l = z30 - 2 -- (35) 

in which the quantities Zq{u,u\n) and Z^(u,u;n — 1) ob¬ 
tained from two adjacent frequency bins are multiplied in 
order to mitigate the impact of the timing error and channel 
selectivity on the system performance. As expected, AMMSE 
outperforms all the other methods and achieves a gain of 
approximately 2 dB over the DD, which further increases to 

























2.5 dB and 3 dB with respect to PCRR and PRR, respectively. 
Due to its high sensitivity to timing errors and channel 
distortions, the CFDC does not work properly in such adverse 
scenario, and exhibits an error rate which is close to one at 
all SNR values. 




AMMSE over the other methods. In particular, at SNR=10 
dB we see that the probability of failure of AMMSE is 10 -3 , 
while it increases to 10 -1 with PCRR, PRR and DD. 


Figs. [5] and [6] show the error rate incurred in the estimation 
of the sector index and IFO, respectively, say P u = Pr{u ^ u } 
and P v = Pr{i> / v }. The operating scenario is the same 
as that in Fig. [4] and also the trend of the curves is similar, 
with AMMSE outperforming the other schemes. It is worth 
noting that the use of AMMSE is more advantageous for the 
estimation of the IFO than for the other two parameters q and 
u. 



Fig. 5. Error rate incurred in the detection of the sector index u vs. SNR 
for 6 = 40. 



Fig. [8]illustrates Pf as a function of 6 when the SNR is fixed 
to 8 dB. These results are useful to assess the sensitivity of the 
considered schemes to residual timing errors and demonstrate 
the remarkable robustness of both DD and AMMSE to such 
an impairment. In contrast, the probability of failure of PCRR, 
PRR and CFDC rapidly deteriorates as 9 increases. This is 
particularly evident for the CFDC, which cannot be used when 
the timing error exceeds a few sampling periods. 


The overall system performance is summarized in Fig. [7] 
where Pf is shown as a function of the SNR with the timing 
error being fixed to 9 = 40. These measurements validate 
the results of Figs. 4][6 and clearly indicate the superiority of 


C. Complexity comparison 

We conclude our study by comparing the investigated 
schemes in terms of their computational complexity. From 
the results reported in Table [II] it turns out that, in the 
considered scenario with N v = 7 and P = 5, the number of 
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Fig. 8. Probability of synchronization failure vs. 6 for SNR = 8 dB. 


required kiloflops (kflops) for each received OFDM symbol is 
approximately 60 for the AMMSE method, 34 for PRR and 11 
for both PCRR and CFDC. As for the DD scheme, computing 
the numerator of A dd{<p) in (35 1 starting from Ziq(u, v) needs 
245 flops for each couple (u, v) which, after including the 
computational burden required to get the quantities ||X. || 
and Z q(u,i>), leads to an overall processing requirement of 
13 kflops. These figures indicate that the improved accuracy 
of the AMMSE is achieved at the price of a higher system 
complexity. In particular, the number of flops is increased by 
a factor of two with respect to PCRR, and by a factor 5.5 with 
respect to the other methods. However, considering the fact 
that in the investigated simulation set-up the AMMSE is the 
only scheme that can ensure acceptable error rate performance, 
the penalty in terms of computational load seems tolerable. 


VI. Conclusions 

We have presented an ML approach for joint PSS detection, 
sector index identification and IFO recovery in the downlink 
of an LTE system. The proposed scheme (AMMSE) operates 
in the frequency-domain and is based on a novel MMSE 
reduced-rank representation of the channel frequency response 
over the SCH subcarriers, which mitigates the impact of 
multipath distortions and residual timing errors on the system 
performance. Compared to existing schemes available in the 
literature, our method exhibits improved accuracy at the price 
of a higher computational complexity. The penalty in terms 
of required flops is justified by the fact that AMMSE can be 
used in a harsh environment characterized by prolonged delay 
spreads and non-negligible timing errors, where other compet¬ 
ing schemes provide unacceptable performance. Furthermore, 
it represents a promising candidate for future heterogeneous 
networks in which the coexistence of femto-, pico- and macro¬ 
cells will require fast and successful detection of neighboring 
cells for efficient interference management and fast handover 
operations. 


References 

[1] E. Dahlman, S. Parkvall, J. Skold, and R Beming, 3G Evolution: HSPA 
and LTE for mobile broadband, Academic Press Inc., 2008. 

[2] 3GPP TS 36.101 v8.9.0 3rd Generation Partnership Project; technical 
specification group radio access network; evolved universal terrestrial 
radio access (E-UTRA); user equipment (UE) radio transmission and 
reception (release 8), Techn. Rep., Dec. 2009. 

[3] H. G. Myung and D. J. Goodman, Single Carrier FDMA - A new air 
interface for long term evolution. New York: John Wiley and Sons, 2008. 

[4] E. W. Yi-Pin and O. Tony, ’’Cell search in W-CDMA”, IEEE J. Sel 
Areas Commun., vol. 18, n. 8, pp. 1470-1482, Aug. 2000. 

[5] M. M. Wang, A. Agrawal, A. Khandekar, and S. Aedudodla, ’’Preamble 
design, system acquisition, and determination in modem OFDMA cel¬ 
lular communications: an overview”, IEEE Commun. Mag., vol. 49, n. 
7, pp. 164-175, July 2011. 

[6] 3GPP TS 36.211 V8.7.0, ’’Physical channels and modulation”, 2009. 

[7] Y. Tsai, G. Zhang, D. Grieco, F. Ozluturk, and X. Wang, ’’Cell search 
in 3GPP long term evolution systems”, IEEE Veh. Technol. Mag., pp. 
23-29, June 2007. 

[8] M. Sandell, J. J. van de Beek, and P. O. Borjesson, ”ML estimation of 
time and frequency offset in OFDM systems”, IEEE Trans, on Signal 
Processing, vol. 45, n. 7, pp. 1800-1805, July 1997. 

[9] H. Setiawan and H. Ochi, ”A low complexity physical-layer identity 
detection for 3GPP Long Term Evolution”, in Proc. of the 12th Int. 
Conf. on Advanced Commun. Techn. (ICACT) 2010. 

[10] B. M. Popovic and F. Berggren, ’’Primary synchronization signal in E- 
UTRA”, in Proc. of 10th Int. Symp. on Spread Spectrum Techn. and 
Appl. (ISSSTA) 2008, pp. 426-430, 2008. 

[11] Y. Yu and Q. Zhu, ”A novel time synchronization for 3 GPP LTE cell 
search”, in Proc. of 8th Int. Conf. on Commun. and Net. in China, 2013. 

[12] Z. Zhang, J. Liu, and K. Long, ’’Low-complexity cell search with fast 
PSS identification in LTE”, IEEE Trans, on Vehic. Technology, vol. 61, 
n. 4, pp. 1719-1729, May 2012. 

[13] Y. Gao, G. Zhu, X. Chen, D. Wu, and B. Ban, ”A modified algorithm 
of synchronization signal detection for LTE initial cell search”, in Proc. 
of 6th Int. Conf. on Commun. and Net. in China, 2011. 

[14] K. Manolakis, D. M. G. Estevez, V. Jungnickel, W. Xu, and C. Drewes, 
”A closed concept for synchronization and cell search in 3GPP LTE 
systems”, in Proc. of Wireless Commun. and Net. Conf. (WCNC) 2009, 
Budapest, Apr. 2009. 

[15] Y. H. Tsai and T. H. Sang, ”A new timing synchronization and cell 
search procedure resistant to carrier frequency offsets for 3GPP-LTE 
downlink”, in Proc. of 1st IEEE Int. Conf. on Commun. in China, pp. 
334-338, 2012. 

[16] S. L. Su, Y. C. Lin, and Y. J. Fan, ’’Joint sector identity and integer part 
of carrier frequency offset detection by phase-difference in Long Term 
Evolution cell search process”, IET Communications, vol. 7, n. 10, pp. 
950-959, 2013. 

[17] A. R. Elsherif and M. M. Khairy, ’’Adaptive primary synchronization 
signal detection for 3GPP Long Term Evolution”, in Proc. of 9th Int. 
Wireless Commun. and Mobile Comp. Conf, 2013. 

[18] L.-C. Wung, Y.-C. Lin, Y.-J. Fan, and S.-L. Su, ”A robust scheme in 
downlink synchronization and initial cell search for 3GPP LTE systems”, 
in Proc. of 6th Int. Symp. on Wireless and Pervasive Computing (ISWPC) 

2011, pp. 1-6, 2011. 

[19] M. Morelli and M. Moretti, ”ML estimation of timing, integer frequency 
and primary sequence index in LTE systems”, in Proc. of Int. Conf. on 
Commun. (ICC), 2015. 

[20] M. Morelli and M. Moretti, ’’Carrier Frequency Offset Estimation for 
OFDM Direct-Conversion Receivers,” IEEE Trans. Wireless Commun., 
vol. 11, no.7, pp. 2670-2679, July 2012. 

[21] B. M. Popovic, ’’Generalized chirp-like polyphase sequences with opti¬ 
mum correlation properties”, IEEE Trans. Inf. Theory, vol. 30, n. 3, pp. 
769-777, July 1992. 

[22] H. Minn, V. K. Bhargava, and K. B. Letaief, ”A robust timing and 
frequency synchronization for OFDM systems”, IEEE Trans. Wireless 
Commun., vol. 2, n. 4, pp. 822-839, July 2003. 

[23] P. Y. Tsai and H. W. Chang, ”A new cell search scheme in 3GPP Long 
Term Evolution downlink OFDMA systems”, in Proc. of 12th Int. Conf. 
on Wireless Commun. and Signal Proc., 2009. 

[24] X. Yang, Y. Xiong, G. Jia, W. Fang, and X. Zheng, ”PSS based time 
synchronization for 3GPP LTE downlink receivers”, in Proc. of 13th Int. 
Conf. on Commun. Techn. (ICCT) 2011, pp. 930-933, 2011. 

[25] C. Y. Chu, I. W. Lai, Y. Y. Lan, and T. D. Chiueh, ’’Efficient sequential 
integer CFO and sector identity detection for LTE cell search”, IEEE 
Wireless Commun. Letters, vol. 3, n. 4, pp. 389-392, Aug. 2014. 


















10 


[26] Y. Shen, T. Luo, and M. Z. Win, ’’Neighboring cell search for LTE 
systems”, IEEE Trans, on Wireless Commun., vol. 11, n. 3, pp. 908- 
919, March 2012. 

[27] I. T. S. Sesia and M. Baker, LTE - the UMTS Long Term Evolution: 
from theory to practice , New York: John Wiley and Sons, 2009. 



